Using data mining and OLAP to discover patterns in a database of patients with Y-chromosome deletions

نویسندگان

  • Saso Dzeroski
  • Dimitar Hristovski
  • Borut Peterlin
چکیده

The paper presents a database of published Y chromosome deletions and the results of analyzing the database with data mining techniques. The database describes 382 patients for which 177 different markers were tested: 364 of the 382 patients had deletions. Two data mining techniques, clustering and decision tree induction were used. Clustering was used to group patients according to the overall presence/absence of deletions at the tested markers. Decision trees and On-Line-Analytical-Processing (OLAP) were used to inspect the resulting clustering and look for correlations between deletion patterns, populations and the clinical picture of infertility. The results of the analysis indicate that there are correlations between deletion patterns and patient populations, as well as clinical phenotype severity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

O-1: Evaluation of Ethnic Patterns of Y Chromosome Microdeletions in Iranian Infertile Men with Azoospermia/Severe Oligospermia Referred to Royan Institute

Background: Microdeletions of the long arm of the chromosome Y are the most common molecular genetic cause of severe infertility in men which affect three regions of AZFa, AZFb and AZFc (Azoospermia factor). These regions contain various genes involved in spermatogenesis. The effect of ethnicity on the patterns of Y chromosome microdeletions has not been extensively studied, particulary in Iran...

متن کامل

SUBMICROSCOPIC DELETIONS OF THE Y CHROMOSOME ARE NOT LIMITED TO AZOOSPERMIC MEN, BUT ARE ALSO DETECTED IN INFERTILE MEN WITH IDIOPATHIC OLIGOZOOSPERMIA

It is now agreed that 10-25% of infertile men with azoospermia have submicroscopic deletions of the Y chromosome long ann (yq), consistent with the proposed location of the azoospermia locus (AZF) in Yq 11.23. However, it is not known whether Yq microdeletions are unique to men with azoospermia or whether they are also observed in infertile men with less severe defects of spermatogenesis (o...

متن کامل

Prediction and Diagnosis of Diabetes Mellitus using a Water Wave Optimization Algorithm

Data mining is an appropriate way to discover information and hidden patterns in large amounts of data, where the hidden patterns cannot be easily discovered in normal ways. One of the most interesting applications of data mining is the discovery of diseases and disease patterns through investigating patients' records. Early diagnosis of diabetes can reduce the effects of this devastating disea...

متن کامل

Data sanitization in association rule mining based on impact factor

Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...

متن کامل

Molecular Study of Partial Deletions of AZFc Region of the Y Chromosome in Infertile Men

Background & Aims: The most significant cause of infertility in men is the genetic deletion in the azoospermia factor (AZF) region that is caused by the process of intra- and inter-chromosomal homologous recombination in amplicons. Homologous recombination could also result in partial deletions in AZF region. The aim of this research was to determine the association between the partial AZFc del...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2000